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STEIN'S METHOD FOR DISCRETE GIBBS MEASURES 1 
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Stein's method provides a way of bounding the distance of a prob- 
ability distribution to a target distribution /Li. Here we develop Stein's 
method for the class of discrete Gibbs measures with a density e v , 
where V is the energy function. Using size bias couplings, we treat an 
example of Gibbs convergence for strongly correlated random vari- 
ables due to Chayes and Klein [Helv. Phys. Acta 67 (1994) 30-42]. 
We obtain estimates of the approximation to a grand-canonical Gibbs 
ensemble. As side results, we slightly improve on the Barbour, Hoist 
and Janson [Poisson Approximation (1992)] bounds for Poisson ap- 
proximation to the sum of independent indicators, and in the case of 
the geometric distribution we derive better nonuniform Stein bounds 
than Brown and Xia [Ann. Probab. 29 (2001) 1373-1403]. 

0. Introduction. Stein [17] introduced an elegant method for proving 
convergence of random variables toward a standard normal variable. Bar- 
bour [2, 3] and Gdtze [10] developed a dynamical point of view of Stein's 
method using time-reversible Markov processes. If [x is the stationary dis- 
tribution of a homogeneous Markov process with generator A, then X ~ [i 
if and only if EAg(X) = for all functions g in the domain T>{A) of the 
operator A. For any random variable W and for any suitable function /, to 
assess the distance \Ef{W) — f f dfi\ we first find a solution g of the equation 

Ag{x)=f(x)- J fdfi. 

If g is in the domain T>(A) of A, then we obtain 



(0.1) 



Ef(W)- Jfdfi 



\EAg(W)\. 
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Bounding the right-hand side of (0.1) for a sufficiently large class of func- 
tions / leads to bounds on distances between the distributions. Here we will 
mainly focus on the total variation distance, where indicator functions are 
the test functions to consider. 

Such distributional bounds are not only useful for limits, but they are also 
of particular interest, for example, when the distance to the target distri- 
bution is not negligible. Such relatively large distances occur, for example, 
when only few observations are available, or when there is considerable de- 
pendence in the data slowing down the convergence. The bound on the dis- 
tance can then be taken into account explicitly, for example when deriving 
confidence intervals. 

In Section 1 we introduce Stein's method using the generator approach for 
discrete Gibbs measures, which are probability measures on No with proba- 
bility weights /J,(k) proportional to exp(V(k)) for some function F:No — >R. 
The discrete Gibbs measures include the classical distributions Poisson, bi- 
nomial, geometric, negative binomial, hyper geometric and the discrete uni- 
form, to name but a few. One can construct simple birth-death processes, 
which are time-reversible, and which have a discrete Gibbs measure as its 
equilibrium measure. In the context of spatial Gibbs measures this connec- 
tion was introduced by Preston [15]. 

In Section 2 we not only recall bounds for the increments of the solution 
of the Stein equation from [5], but we also derive bounds on the solution 
itself, in terms of potential function of the Gibbs measure; see Lemmas 2.1, 
2.4 and 2.5. The bounds, which to our knowledge are new, are illustrated 
for the Poisson, the binomial and the geometric distribution. 

For nonnegative random variables, the size bias coupling is a very useful 
approach to disentangle dependence. Its formulation and its application to 
assess the distance to Gibbs measures are described in Section 3. We compare 
the distributions by comparing their respective generators, an idea also used 
in [11], while paying special attention to the case that the domains of the 
two generators are not identical. The size bias coupling then naturally leads 
to Theorem 3.5 and Corollary 3.8. 

Section 4 applies these theoretical results to assess the distance to a Gibbs 
distribution for the law of a sum of possibly strongly correlated random 
variables. The main results are Theorem 4.2 and Proposition 4.4, where we 
give general bounds for the total variation distance between the distribution 
of certain sums of strongly correlated random variables and discrete Gibbs 
distributions of a grand-canonical form; see [16], Section 1.2.3. In particular, 
Theorem 4.2 gives a bound on the rate of convergence for the qualitative 
results in [6] by bounding the rate of convergence. 

Considering two examples with nontrivial interaction, we obtain bounds 
to limiting nonclassical Gibbs distributions. Our bound on the approxima- 
tion error is phrased in terms of the particle number and the average density 
of the particles. 
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Summarizing, the main advantage of our considerations is the application 
of Stein's method to models with interaction described by Gibbs measures. 
When applying our bounds on the solution of the Stein equation to the 
Poisson distribution and the geometric distribution, surprisingly we obtain 
improved bounds for these well-studied distributions. Thus our investigation 
of discrete Gibbs measures serves also as a vehicle for obtaining results for 
classical discrete distributions. 

The results presented here will also provide a foundation for introducing 
Stein's method for spatial Gibbs measures and Gibbs point processes, in 
forthcoming work. 

1. Gibbs measures and birth death processes. 

1.1. Birth-death processes. A birth-death process {X(t),t G R} is a Markov 
process on the state space {0,1,..., N}, where N G No U {oo}, characterized 
by (nonnegative) birth rates {bj,j G {0, 1, . . . , N}} and (nonnegative) death 
rates {dj,j G {1, .. . ,N}} and has a generator 

(1.1) (Ah)(j) = bj{h(J + 1) - h(j)) - djihU) - h(j - 1)) 

with j G {0,1,..., N}. 

It is well known that for any given probability distribution [i on No one can 
construct a birth-death process which has this distribution as its station- 
ary distribution. For N = oo, recurrence of the birth-death process X(-) is 
equivalent to J2n>i ti-'-t"' = 00 • P rocess 1S ergodic if and only if 

the process is recurrent and c := 1 + X)n>i ^i.-d 1 < 00 ■ ^ or < oo, irre- 
ducibility and hence ergodicity holds if bj > 0, j = 0, 1, . . . , N — 1, b^ = 
and dj > 0, j = 1, . . . , TV. 

In either case the stationary distribution of the ergodic process is given 
by /x(0) = 1/c and 

(1.2) M(n) = M (0)^%i. 

For any given probability distribution /x on No these recursive formulas give 
the corresponding class of birth-death processes which have fi as the sta- 
tionary distribution. For the choice of a unit per capita death rate dj = j one 
simply obtains that 

(1.3, ^^^W 

for j < N — 1. Here and throughout, if N = oo, then by j < N — 1 we mean 

j = 0, 1, The choice of these rates corresponds to the case where the 

detailed balance condition 

(i.4) M (j)^=/i(i + iH + i, j = o,i,...,iv-i, 
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holds; see, for example, [1] and [12]. We will apply these well-known facts to 
the discrete Gibbs measure introduced in the next subsection. 

Now might be a good time to note that not all probability distributions 
on No are given in closed form expressions; notable exceptions occur, for 
example, in compound Poisson distributions. 

1.2. Gibbs measures as stationary distributions of birth-death processes. 
Gibbs measures can be viewed as stationary measures of birth-death pro- 
cesses, as follows. 

We start with a discrete Gibbs measure fi; for convenience we assume 
that [i has support supp( / u) = {0, . . . , N}, where TV G No U {oo}, so that \x is 
given by 

(1.5) ^ k) = T V{k) JT> k = 0,l,...,N, 
for some function V : No — > R. Here Z = J2k=o ex P 

(^(fe))^-, and u > is 

fixed. We assume that Z exists; Z is known as the partition function in 
models of statistical mechanics. We set V{k) = — oo for k > N. In terms of 
statistical mechanics, \i is a grand-canonical ensemble, u> is the activity and 
V is the potential energy; see [16], Chapter 1.2. 

The class of discrete Gibbs measures in (1.5) is equivalent to the class of 
all discrete probability distributions on No by the following simple identifi- 
cation: For a given probability distribution (fj,(k))keN we have 

(1.6) V (k)= log fi(k) + log k\ + log Z -kloguj, k = 0, 1, . . . , N, 

with V(0) = logjti(O) + logZ. Hence Z = e ^/Q\ ■ The latter formula gives the 
possibility of proving convergence for a sequence of partition functions Z n 
by using the convergence of the corresponding sequence fj, n (0). 

However, the representation of a probability measure as a Gibbs mea- 
sure is not unique. For example, the Poisson distribution with parameter 
A, Po(A), can be written in the form (1.5) with u = A, V(k) = —A, k > 0, 
Z = 1. Alternatively, we could have chosen V(k) = 0, u> = A, Z = e x . 

From (1.3), if we choose a unit per capita death rate = k, and if we 
choose the birth rate 

V(k+1)-V{k) _ n. , + 



(1.7) b k = ue v ^- v ^ = (k + l) 

/i(fc) 

for k, k + 1 G supp(/x), then 

(1.8) (Ah)(k) = (h{k + 1) - h{k))uje v{ - k+l) - v{ - k) + k(h(k - 1) - h(k)), 

for k, k + 1, k — 1 G supp(/i), k G N [set h(— 1) =0], is the generator of the 
time-reversible birth-death process with invariant measure 
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Note that this choice of dk and bk ensures that the detailed balance con- 
dition (1.4) is satisfied. Hence we have chosen a birth-death process with 
generator of the type (1.1) which is easily seen to be an ergodic process. 
Namely, if N = oo, then the recurrence of the corresponding process is given 
since 

4^ h • • • b n ~ ^ (n + l)u(n + I) ~ ^ n + I ~ °°' 

ri>\ 1 " n>l v ' ri>\ 

For N < oo we have that V(k) > — oo for k E supp(/i), so that > for 
k = 0, . . . , N — 1, and as V(N + 1) = — oo, we obtain b^ = 0. Due to the unit 
per capita death rate the ergodicity follows. From € supp(^) we obtain 
c = l//i(0) < oo. Hence fj, is indeed the unique stationary distribution of the 
birth-death process. 

In the development of Stein's method unit per capita is a common and 
useful choice for the death rate; see [2]. It is worth noting that there are 
modifications of this choice in [5] and [11]. 

To compare with the approach in Barbour [2] we reformulate the gener- 
ator: let h(k + 1) - h(k) =: g(k + 1); then (1.8) yields 

(1.9) (Ag)(k) =g{k + l)uje v{ - k+l) - y{k) - kg(k), k = 0, 1, . . . , N. 

The generalization to case of arbitrary death rates d^ is straightforward; 
we omit it here to streamline the paper. 

2. Stein identity for Gibbs measures and bounds. In view of the gen- 
erator approach to Stein's method, for a test function / : supp(/x) — ► R the 
appropriate Stein equation for fj, given in (1.5) is 

(2-1) (Ag)U) = M-tif) 

for j G {0, . . . , N} and A the generator given by (1.8). Here, 

N 

Kf):=J2f(kMk) 

k=0 

is the expectation of / under fx. We are interested in indicator functions 
f(j) = Iy^A] f° r some A C supp(/i). Thus if If is a random variable on 
supp(/x), we obtain 

(2.2) EAg(W) = P(WeA)-fi(A). 

The right-hand side of (2.2) links in nicely with the total variation distance. 
Recall that for P and Q being probability distributions on No, we define the 
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total variation distance (metric) by 

d T y(P,Q):= sup \P{A)-Q{A)\ 

AcN 

= sup|P(/)-Q(/)| 
/eSi 

= \ £ \P({k}) ~ Q({k})l 

fcGN 

where B\ denotes the set of measurable functions / with < / < 1. Hence 
bounding the left-hand side of (2.2) uniformly in A C No gives a bound on 
the total variation distance. 

In the following we give a Stein characterization for /i given in (1.5) and 
a solution of the corresponding Stein equation. 

Let Z be a random variable distributed according to the Gibbs measure \x 
defined in (1.5) and for a function g:No — > R assume E\Zg(Z)\ < 00. Prom 
(1.9) we obtain the Stein characterization for \x that if Z is distributed 
according to /U, given by (1.5), then for every function g: No — > R with 
E\Zg(Z)\ <oo 

(2.3) E{Loe v{z+1) ~ v(z) g(Z + 1) - Zg(Z)} = 0. 

If /:supp(/x) — > R is an arbitrary function, and [i is given by (1.5), then 
there exists a solution gjy : No — > M for (2.1) with operator as in (1.9), 

(2.4) + l)u,e y ( fe+1 )- v ( fe ) - kgjy(k) = f(k) - /,(/), k<N; 

see [5]. This solution g^y is such that <7/,y(0) = 0, and for j = 0, . . . , N — 1 
the solution gyy can be represented by recursion as 

(2-5) g f y(j + 1) = ^e~ V ^ £ ~ M) 

k=0 

(2-6) =-jL e -vU + V £ e V(^ (/W -, (/)) . 

We may set g f y{N + 1) = 0. 

Having a suitable Stein equation for Gibbs measures and its solution at 
our disposal, the next step in Stein's method is to bound the increments of 
the solutions; it will turn out advantageous to bound the solutions them- 
selves as well. For any function g : No — ► M we define 

Ag(j):=g(j + l)-g(j). 

In applications often only bounds on the increments are needed, hence we 
start with these. Uniform bounds on the increments are also called Stein 
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factors or magic factors. Nonuniform Stein factor bounds may yield better 
overall bounds on distributional distances and are therefore of particular 
interest. Lemma 2.1 gives such a nonuniform bound. The proof is given in 
[5], Lemma 2.4 and Theorem 2.1. We introduce the class of functions 

(2.7) B:={/:suppOi)-[0,l]} 
and we define 

k N 

(2.8) F(fc):=£M*), F(fc):=5>(0- 

i=0 i=k 

Lemma 2.1 (Nonuniform bounds for increments). Assume that the death 
rates are unit per capita and assume that the birth rates in (1.7) fulfill, for 
each k = l,2,...,N - 1, 

(2.9) k-^- > > k P{k + 1} 



F(k - 1) ~ ~ F(k) ' 

Let f G B and let g$y be its solution to the Stein equation (2.4)- Then, for 
every j e{0,...,N}, 

(2.10) sup|A^0")l = -e v ^~ v ^F(j + 1) + -F(j - 1). 

fee u j 

Moreover, for every j G {0, . . . , N}, 

1 e v ^ 
sup\Ag fy (j)\ < - A — TTT-TTT- 

feB J we y U+AJ 

Remark 2.2. Condition (2.9) is Condition (C2) in [5], Lemma 2.4. In 
this paper, three more conditions are formulated, which are all equivalent 
to (C2). For example, if the death rates are unit per capita and the birth 
rates are nonincreasing: 

(2.11) e v{k+D-v(k) < e v(k)-v(k-i)^ k = 0,1, . . . , N, 
Condition (C4) in [5] is satisfied and (2.10) holds. 

Remark 2.3. Reference [11] gives an elegant recursive proof of (2.10) 
for a choice of birth rates and death rates which make the method of ex- 
changeable pairs work. In particular, unit per capita death rates are not 
used in her results. 



Under a slightly weaker condition than (2.9) it is possible to derive nonuni- 
form bounds on the solution gty of the Stein equation itself, as follows. 
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Lemma 2.4. Consider the solution g$y of the Stein equation (2.4), 
where f £ B, given in (2. 7). Assume that, for each k = 1, 2, . . . , N — 1, 

(2.12) oje v ^- v ^>k F{k + 1) 



F(k) 

is satisfied. Then we obtain for every j £ {1, . . . , N} that 

\9f,v(j)\ < (min|lnO0,E fc M^)} +^e^°)-^ 1 )^. 

The proof is related to ideas of [4], pages 7-8, used in Poisson approxi- 
mation. 

Proof of Lemma 2.4. For j £ {0, . . . ,N} let Uj = {0,1,... ,j}. We use 
the notation fl^(x) = f(x)l(x £ A). It is easy to see from (2.5) that for 
/ £ B and A C {0, . . . , N}, and for j £ {0, . . . ,N - 1}, 



9f,vU + 1) = ^e-^^MfluM^) ~ Kflu-MUj)} 



and hence 



(2.13) \g fy (j + 1)| < -J^e-n^^UM^)- 

From (2.12) we have that 

-4r^ (j+1) M^) = -t e V ^~ V ^^ 

< ' F(0) 1 1 m _ m F(0) 



,J X F(j + l)k u F(j + 1) 

1 V/i'n1_Vf'1l\ 1 



< ^i n ( i) + I e v(o)-v(i)^ 



^(j + 1) 



Alternatively, 



/ N 1 

\k=l 

( N 1 

= [Y,k^(k) + -e 



V(0)-V(1) 



1 



iV 

V(0)-V(1) 



^(i + 1) 
1^(7+1)' 
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We now prove a crude but usable bound for the supremum norm Hff/yll 
which does not require (2.9) or (2.12) to be satisfied. To this purpose we 
introduce the quantities 

Ai:=u; inf e n*+i)-V(fc) = inf b 

0<k<N-l 0<k<N-l 

(2.14) 

X 2 :=lo sup e n*+i)-V(fc) = sup bkm 

0<fc<7V-l 0<fc<7V-l 

Note that ^ > 1 by construction. 

Lemma 2.5. Consider the solution gty of the Stein equation (2.4), 
where f E B, given in (2.7). Assume that \\ > and A2 < 00. Then 

H^II^ 2 +2(a7+iJ 1(A 2 -2>A!). 

Proof. As the proof follows the ideas of [4], pages 7-8, used in Poisson 
approximation, we only sketch it here. With the notation as for (2.13) we 
obtain the bounds 



\9f,vU + l)\< J^e-^ +1 V(£/ : 



1 J / >k—jA\ 

j n\ 



% 1 (;-/)!■ 

Similarly we have 



|5/y(j + l)l<-^re-^ +1 V(^ 



A V r V(k) - V(i+1) 3 

fe! 



N 



k-j-lJ 



2 k\ 



< E A fe " J 

fc=J+l 

= j! e A U 2 " J " 1 P (A 2 )(C/;). 
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This puts us in the situation of (1.20) and (1.21) in Chapter 1 of [4]. We 
obtain that for j < Ai 

\g f y(j + l)\ <2min(l,Ar 1/2 )<2, 

and for j > A2 — 2 

Thus we have proved the assertion for j < X\ and for j > A2 — 2. If Ai > 
A2 — 2, then our bound covers the whole domain. 

Now assume that S = { |_AiJ + 1, . . . , LA2J — 2} is nonempty. For j £ S we 
have 

\ 9f y(j + 1)1 < ^e- v V+V(ji{U? HS) + n{U? \ S))^) 

<2 + -j| T e-^ +1 )/i([/;n < S) 

LA2J-2 

fc=j+i 

< 2 + j!A 2 "- ? '- 1 e A2 Po(A2){0, . . . , LA 2 J - 2}. 

From [4], Proposition A.2.3(iii), page 259, the Poisson probabilities can be 
bounded as 

Po(A 2 ){0, . . . , LA 2 J - 2} < A2 , Po(A 2 )(LA2j - 2) 

A 2 + I - [A2j 

and so, as LA2J — 1 — j > 1 for j < LA2J — 2, 

l^(, + l)l<2 + i II 4^A^-- 

< 2 + l(j + l)-(LA 2 j-2- J -) A ^A 2 j-2- J 

If A 2 

< 2 + 



2VA1 + I 

1/ A 2 \ A 2^ A i- 2 
<2 + - 



2VAi + l 
This finishes the proof. □ 



Remark 2.6. As Ai and A2 stay invariant under the reparametrization 
uj — > ui = au>, we argue that these are reasonable quantities to employ. 
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Remark 2.7. While the examples below will show that the bound in 
Lemma 2.5 may not be informative, in particular examples better bounds 
may be obtainable in a straightforward manner. Lemma 2.5 is nevertheless 
useful as it gives conditions on the birth rates so that ||<7/,y|| is bounded, 
and these conditions do not involve monotonicity of the birth rates. 

Remark 2.8. Note that neither the last bound in Lemma 2.1 nor the 
bound in Lemma 2.5 use the normalizing constant Z explicitly. 

Again, the generalization of the bounds to the case of arbitrary death 
rates df- would be straightforward. 

A complication arises when we compare two distributions with noniden- 
tical supports. Therefore it will turn out to be useful to consider the follow- 
ing extension from finite to infinite support for a generator A. For conve- 
nience assume that the corresponding measure \x has supp(/i) = {0, 1, . . . , n}, 
for some finite n, so that A is only defined for functions with support 
{0, 1, . . . ,n} [recall that we set g{n + 1) = 0]. We extend A to be defined 
for functions on {0, 1, 2, . . .} as follows: 

(Ag)(k) :=g(k + l)Loe v{k+1) - VW -kg(k), k = 0, 1, . . . , n, 
(2-15) _ 

(Ag)(k) := -kg(k), k>n+l. 

Thus when there are more than n individuals the process is pure death. Now 
if X ~ fj,, then still EAf(X) = for all functions / £ D(A), the domain of 
A, since the operator A represents a birth-death process with the same 
invariant distribution as A; see (1.2). Next we extend the solution of the 
Stein equation, so that for / : {0, 1, . . . , n} — ► ffi, the solution gjy is defined 
by (2.5) for k G {0, 1, . . . and by 

(2.16) g fV ( k ):=^p-, k>n + l. 

[Note that our formula (2.5) would have yielded gfy(n + 1) = 0.] For a 
related suggestion see [4], Chapter 9.2. The above definition ensures that the 
Stein equation (2.4) is still satisfied. However, the bounds on the solution of 
(2.4) change slightly. In contrast to (2.7), let 

(2.17) B := {/ : N - [0, 1], f(x) = for x £ supp(^)}. 

Lemma 2.9. Let A, associated with fi,V, and u be given as in (2.15), 
where \i has supp(/x) = {0, 1, . . . , n}. Let f £ Bo and let gfy be the solution 
of the Stein equation (2.4) given in (2.5) for k <n, and as in (2.16) for 
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k > n + 1. Defining the sum J2?=j+i as f or 3 ^ n > with \\ and A2 6eirz<7 
given in (2.14) for fj,, 

\9f,vU)\ - 2+ 2 (.aTTiJ 1 ( A 2- 2 ^ A i)' J = 0,l,...,n; 

. ,. 1 
\gf,v{j)\<-, j>n + l. 

In the case that (2.12) is satisfied we also have 

\9f,v(j)\ < f nun jln(j),5>Mfc) j + J^- y j = 0, 1, . . . ,n, 

and if (2.9) is satisfied, then 

1 e v ® 



sup|A^ v (j)|< 3 ^ 



/ea 



J > n. 



Proof. The first assertion follows directly from Lemma 2.5 and (2.16). 
The second assertion follows from Lemma 2.1 for j < n — 1, and for j > n + 1 
it follows from (2.16) that 

9f,v(j + 1) - 9f,vU) = M/) <r 



3 + 1 i 



so that 



For j = n we obtain 



g fy (n + 1) - 5/ , v (n) = m(/)-^t " {--(/(") " M/)) 

n + 1 L « 



77 

so that 



L/ /(n)-M/) 



n + iy' 



b/,v(^ + l) -9f,v(n)\ < -■ □ 
We conclude this section with some examples. 

Example 2.10 (Poisson distribution Po(\) with parameter A > 0). We 
use oj = A, V{k) = — A, Z = 1. The Stein operator is 

(Ag)(k)=g(k + l)X-kg(k). 
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Applying Lemma 2.1 (b^ = A is nonincreasing in k) gives the nonuniform 
bound 

(2.18) sup|A 5/iPOS (/c)| < j A-, 

feB k A 

which leads to the well-known uniform bound 1 A 1/A; see [4], Lemma 1.1.1. 
Yet [4], Lemma 1.1.1 gives the better bound A _1 (l — e~ A ) < min(l, 1/A). In 
the Poisson case the right-hand side of (2.10) gives 

k-l W oo U 

1=0 l=k+l 



This sum can be bounded by 




A 

and therefore we obtain for any k > 1 



l) = Ul-e- x ) 



1 



e 



(2.19) sup|A 5/!pos (fc)| <-(l 

feB A 

as in [4], Lemma 1.1.1. The bounds (2.18) and (2.19) lead to 

(2.20) sup|A 5/jPOS (A:)| < \ A ^(1 - e~ A ), 
feB k A 

which is a slight improvement of (2.18) and of [5]. 

For the Poisson distribution, Ai = A2 = A in Lemma 2.5, and the bound 
2 from Lemma 2.5 is poor compared to the bound \\g\\ < min(l, A^ 1 / 2 ) from 
Lemma 1.1.1 on page 7 in [4]. The bound in Lemma 2.4 is cumbersome to 
compute; but for A > y/2 and for j = 1, say, the nonuniform bound |<?(1)| < 
(A(l - e~ A ))^ 1 is slightly more informative than the uniform bound \\g\\ < 
'p^g nonuniform bound improves with increasing A, and deteriorates 
with increasing j. 

Example 2.11 (Binomial distribution with parameters n and <p < 1). 
We use u = ^,V(k) = -log((n - k)\), and Z = (n!(l - p) n ) _1 . The Stein 
operator is 

(Ag)(k)=g(k + l)^—^--kg(k). 

In [4] and [7] we find (Ag)(k) = g(k + l)p(n — k) — (1 — p)kg(k) as the op- 
erator which is equivalent to our formulation. The birth rates (6fe)fe are 
nonincreasing and we obtain from Lemma 2.1 that 

sup|Ac/ /jbin (fc)| < 1 A 



feB (l-p)k p(n-k) 
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The proof of [4], Lemma 9.2.1, implicitly contains the same nonuniform 
bound. Formula (18) in [7] gives a bound of the form 

sup | At// bin (fe) | < (1 - P n+l - (1 - P) n+1 )/((n + 1)(1 - p)p) 
feB 

for every < k < n. Simple calculations show that the bound (18) in [7] is for 
some cases better and for some cases worse in comparison to our nonuniform 
result. From Lemma 2.5 with Ai = 1, A2 = n we obtain a bound on 1 1 <? /,t.in 1 1 
which is of order 0(e nlnn ) and therefore in most applications not useful. 
The nonuniform bound Lemma 2.4 



k=0 



g(j)\ < ( 1- E ( k ) pk{l - pT ~ k J ( min {ln(j),np} 



np 



will still be informative for j small and np large. 



Example 2.12 (Geometric distribution). Consider /u,(k) =p(l —p) k for 
k = 0, 1, The Stein operator is 

(Ag){k) = g{k + 1)(1 -p){k + 1) - kg(k). 

The birth rates = (1 — p)(k + 1) fulfill — b^_i < fc — (k — 1) = — dfc-i- 
This is condition (C4) in [5], which is sufficient for (2.9). Hence applying [5], 
Theorem 2.10, one obtains 

sup \Ag ftgco (k)\ <jA — 

which leads to the uniform bound 1 A 1, 1 \ . 

We can improve their bounds calculating the right-hand side of (2.10) 
explicitly: 

(I-PV , 1,. ,, y, j + l-(l-p>' 

'7TT + j"-"- f ''»° + ■ 

Obviously \Agfy(j)\ < i. Using Bernoulli's inequality (1 — x)™ > 1 — nx for 
x < 1 and n € N it follows that 

|A W ,,o)i<-(- Tiy =- rT . 

We obtain 



1 1 +p 

sup|A# /igco (£;)| < — A 
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which leads to the uniform bound 1 A (1 +p). 

In Lemma 2.5 we obtain Ai = 1 —p, A2 = 00, and hence the lemma does not 
give informative bounds. Lemma 2.4 gives, with F(j) = (1 —p) 3 , the bound 
|ff/, g eo(i)| < ijZpp(™n{ln(j), |} + j^). However, with (2.6) and / G B we 
may bound directly 



b/, g eo(i + i)i< (1 _^ ! 1(j . +1)!fc Ea-^ 



1 ^ 1 



j + 



-Y(i- P ) k = - -. 

1 to P(3 + !) 



and using that 5/, ge o(0) = we obtain the improved bound ||<7/,geo|| < |- To 
our knowledge this bound is new. 

In [13] the author considered another Stein operator. Hence the results 
cannot be compared. In [14] the authors considered the same Stein operator. 
They obtain supy eB | Agf jgeo (k)\ < p Hence [5] already improved this result. 



3. Size-bias couplings and Gibbs measures. 



3.1. Size-bias couplings and Stein characterizations. Stein's method is 
most powerful in presence of dependence. To treat global dependence, cou- 
plings have proved a useful tool. For discrete, nonnegative random variables, 
so-called size-bias couplings fit nicely into our framework as they link in with 
unit per capita death rate generators. 

For any random variable X > with EX > we say that a random 
variable X* , defined on the same probability space as X, has the X -size- 
biased distribution if 

(3.1) EXf(X)=EXEf(X*) 

for all functions / such that both sides of (3.1) exist. Size-bias couplings 
(X, X*) have been studied in connection with Stein's method for Poisson 
approximation (see [4], e.g.) and for normal approximations (see [9], e.g.); a 
general framework is given in [8]. 

If X is discrete, then for all x we have P(X* = x) = -^P(X = x). This 
illustrates that size biasing corresponds to sampling proportional to size; the 
larger a subpopulation, the more likely it is to be contained in the sample. 



Example 3.1 (Poisson distribution). If X ~ Po(X), then from the Stein 
operator in Example 2.10 we read off that X* = X + 1 has the Po(A)-size- 
biased distribution. In [4] a related coupling is used, namely the reduced 
size-biased coupling (X, X* ) , with X* = X* — 1 . 
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Example 3.2 (Bernoulli distribution). If X ~ Be(p), it is easy to see 
that X* = 1 has the 5e(p)-size-biased distribution, for all p£ (0,1]. As an 
aside, this shows that X* does not uniquely define X. 

Example 3.3 (Sum of nonnegative random variables). Let X\, . . . ,X n 
be nonnegative, EXi = m > 0, W = £f =1 X h EW = /x, Var(W) = 1. Gold- 
stein and Rinott [9] give the following construction, valid also in the presence 
of dependence. Choose an index / from {1, . . . , n} according to P{I = i) = 
If / = i, replace Xj by a variate X* having the Aj-size-biased distribution. 
If X* = x, construct Xj, j ^ i, such that 

C(Xj,j + i\X* = x)= C(Xj,j ^ i\Xi = x). 

Then 

(3.2) W* = Y, x j + x i 

has the IV-size-biased distribution; see [9], construction after Lemma 2.1. 

As in the Poisson case, the size-bias coupling can be used to derive a 
characterization of a Gibbs measure, as follows. 

Lemma 3.4. Let X > be such that < E(X) < oo, and let \i be a 
discrete Gibbs measure given in (1.5). If X ~ /i and X* having the X-size- 
biased distribution is defined on the same probability space as X , then for 
every function g : No — > M such that E\Xg{X)\ exists, 

(3.3) coEe v ^ x+1 ^ v ^g(X + 1) = uEe v ^- v ^ Eg(X*). 

Proof. In view of (2.3), for any X > with EX > and (X, X*) a 
size-biased coupling we have, for any bounded function g : No — > M 

E{ue v ( x+ V- v Wg(X + 1) - Xg(X)} 

= E{ioe v{x+1) ~ v{x) g{X + 1) - EXEg(X*)}. 

For X having distribution (1.5), the expectation is given by 

EX = LoEe v ^- v ^ x l 

From EX > it follows that The result now follows from the Stein 

characterization (2.3) of (1.5) for □ 

Indeed in (3.3) the factor to cancels. However, in a moment we shall relate 
(3.3) to the Stein equation. In order not to get confused about the solutions 
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for the Stein equation with and without the u involved, we have decided to 
keep the to. 

Lemma 3.4 provides a new formulation of the Stein approach (2.2): Let 
W > with < EW < oo. If W has distribution /i, then for all / £ B 

Ef(W)-fi(f) 
(3.4) 

= u{Ee v( - w+i y v Wg f)V (W + 1) - Ee v( - w+1 ^ v ^ Eg fy (W*)} , 
where <7/,y, given in (2.5), is the solution of the Stein equation (2.4). 

3.2. Comparison of distributions via their generators. From (2.2), for 
any random variable W and any measurable set A we can find a function 
f = such that P(W € A) - fi(A) = EAf(W), where A is the generator 
associated with the target distribution fi. Applications of Stein's method 
usually continue by bounding the right-hand side EAf(W). Nevertheless, 
Stein's method can also be employed to compare two distributions by com- 
paring their generators. While generators may in general not be available, 
for any discrete distribution we have implicitly constructed a generator in 
(1.9). It is from this angle that we shall bound distances between Gibbs 
distributions. Assume that hi and \i2 are two distributions with generators 
A\ and A2, respectively. Then, for W ~ fi2, EA2f fJ , 1 (W) = 0, and therefore 

m(A) - P(W eA) = EAifv (W) - EA 2 U, (W) = E(Ai - A 2 )U 1 (W). 

Hence we can compare two discrete Gibbs distributions by comparing their 
birth rates and their death rates, as follows. 

Theorem 3.5. Let ii\ have generator A\ as in (1.9) and corresponding 
(ui,Vi), and let (12 have generator A2, and corresponding (UJ21V2), both de- 
scribed in terms of unit per capita death rates. Suppose that T>(Ai) = T>(A2) ■ 
Then, for X2 ~ fJ-2, f 6 B, if g/y 1 is the solution of the Stein equation for 



Ef(x 2 )- J fdm 

<min(||<7 /iVl ||£(X 2 ] 



(3.5) x ( N-^al , ^l E UVi(x*)-Vi(x*-i))-(v 2 (x*)-v 2 (x*-i)) _ ^ 

\ it>2 ^2 
||»/,V 3 ||^l) 

( l^ 2 "^ 1 ! + ^l E \ e (V2(X* 1 )-V 2 (Xt-l))-{V 1 {Xl)-V 1 {Xt-l)) _ j| 
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Proof. For X 2 ~ fJ, 2 , / G B, if is the solution of the Stein equation 
for m, we have Ef(X 2 ) - = ^ l5/iVl (X 2 ) = - A 2 )g f , Vl (X 2 ). 

Using the size biasing and (3.3) we obtain 

Ef{X 2 )-^{f) 

= Eg m (X 2 + l){^e v ^ +l ^ v ^ - u 2 e v ^ x * + V- v ^) 

= uxEg f vi (X 2 + i) e ^( x 2+i)-V2(X2) e v 1 (x 2 +i)-v 1 (x 2 )-(V2(X2+i)-V2(x 2 )) 

- E{X 2 )Eg m (XZ) 

= ^-E(X 2 )Eg m (XfieWXZyviW-iV-iWxV-WS-i)) 

L0 2 

- E(X 2 )Eg m (X* 2 ) 

= ^^E(X 2 )Eg fyi (X*) 

UJ 2 
L0 2 

Taking absolute values together with the triangle inequality, and observing 
that the argument is symmetric in the indices 1 and 2, we obtain (3.5). □ 

Remark 3.6. Theorem 3.5 has the advantage that the partition function 
Z may not be needed to assess the distance. 

Remark 3.7. Note that it is not surprising that ||<7/vi|| appears in the 
bound; even if we compare two Poisson distributions, with different means 
Ai and A2, the bound would be of the type |Ai — A2I ||<7/,vi II- 

When V(Ai) / V(A 2 ), we first extend .Ai to A±, defined for functions on 
{0, 1,2,.. .} as in (2.15), and apply then Lemma 2.9. For convenience assume 
that supp(/xi) = {0, 1, . . . , n} and snpp(fi 2 ) = {0, 1,2,.. .}. Using similar cal- 
culations as for Theorem 3.5, we derive the following result. 

Corollary 3.8. Let [i\ have generator A\ and corresponding (u>i,Vi), 
and let fi 2 have generator A 2 , and corresponding (u> 2 ,V 2 ), both described in 
terms of unit per capita death rates. Suppose that supp(^i) = {0,1, ... ,71} 
and that supp(fi 2 ) = {0, 1,2, . . .}. Then, for X 2 ~ fi 2 and f £ Bo as in (2.17), 
if gfyi * s the solution of the Stein equation for fi\ as in (2.15), 

Ef(x 2 )- J fdm 



<min \\g LVl \\E(X 2 ) 
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L0 2 L0 2 



(3.6) 

\\gm\\E(Xi) 

oo 

+ Mi(/) E M*0- 

fc=n+l 

The extra term ^i(f)J2T=n+i ^(k) m the bound (3.6) as compared to 
(3.5) accounts for the extension A\ of the generator A\. 

Proof of Corollary 3.8. We proceed as in the proof of Theorem 
3.5. Let A\ be the extension of A\ as in (2.15). For X 2 ~ fi 2 , / £ £>/, if 9f,Vi 
as in (2.16) is the solution of the Stein equation for /xi, we have 

Ef(X 2 )-nx(f) 

= EA igm {X 2 ) 

= E(Ai-A 2 )g m (X 2 ) 

= Eg m (X 2 + l^e^+V-^^ - ^eVMD-V.ix^ 
n-l 

= £ P (X 2 = k)g m (k + l)(o, ie ^(fc+ 1 )-^(*) - a; 2 e y2 ( fc+1 )- y2 <*)) 

fc=0 



where we used (2.16). Now the summand 

J2 P(X 2 = k)g m (k + lX^e*^ 1 )-* W - u 2 e v ^- v ^) 

k=0 

can be treated exactly as in Theorem 3.5; from Lemma 2.9 we have the same 
bounds on the solution of the Stein equation when k <n. For the second 
summand we have 



Mil/). . „V 2 (k+l)-V 2 {k) 



k=n 



ujouiif) V -e V2{k) ^ e V2(k+i)-v 2 (k) 



20 



P. EICHELSBACHER AND G. REINERT 



oo 

=mi(/) E 

k=n+l 

Thus we obtain (3.6). □ 

Note that the bound in Corollary 3.8 contains the measure \i2 explicitly; 
hence examples will require the calculation of the normalizing constant Z<i- 
We now treat the instructive example that the two generators A\ and A2 
have the same birth and death rates, but live on different domains. 

Example 3.9. Suppose that the distributions m and \xi have supports 
supp(//i) = {0, 1, . . . , n} and supp(//2) = {0, 1,2, . . .}, respectively. Let fii 
have generator Ai and corresponding (tv,V), and let [12 have generator A2, 
and corresponding same V), both described in terms of unit per capita 
death rates, so that the two generators have the same birth rates and death 
rates for k = 0, . . . , n. Then, for X2 ~ ^2 and f £ Bo, if gfy 1 is the solution 
of the Stein equation for [ii, it follows from (3.6) that 

00 

</ii(/) E ^(*0- 

k=n+l 

In particular it follows that 

00 

drv(^i,//2)< E M2(fc), 

k=n+l 

stating that the difference between the two distributions can be bounded by 
the total mass of the domain, under the respective distribution, which is in 
the support of the one distribution, but not of the other distribution. 

Indeed the bound is sharp, as from (1.2) we have that n\{k) = afj,2(k) for 
k = 0, 1, . . . ,71, with a = (J2k=o ^{k))^ 1 > 1. Hence 

n 00 00 

rfTv(w,/i2) = lX!( Q - 1 )' i 2( fc )+l E A»2(fc)= E ^ k )- 

k=0 k=n+l k=n+l 

Theorem 3.5 and Corollary 3.8 invite the study of the total variation 
distance of discrete probability distributions with different supports of the 
corresponding operators, for example considering the distance between a 
binomial and a hyper geometric distribution. As this leads away from the flow 
of the paper, we do not pursue it here. Instead we turn to the application 
of our approach to Gibbs measures arising in interacting particle systems. 



Ef(x 2 )- J fdm 
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4. Application to lattice approximation in statistical physics. Now we 

apply our results to the model studied by Chayes and Klein [6] . Their setup 
is as follows. Assume A C M. d is a rectangle; denote its volume by |^4|. Con- 
sider the intersection of A with the ci-dimensional lattice n~ lr L d . For each 
site m in this intersection we associate a Bernoulli random variable X^ 
which takes value 1, with probability p^, if a particle is present at m and 

otherwise. If the collection (X^) m is independent, the joint distribution 
can be interpreted as the Gibbs distribution for an ideal gas on the lattice. 
The Poisson convergence theorem states that, for n going to infinity, when 
preserving the average density of particles the lattice ideal gas distribution 
converges weakly to a Poisson distribution, which is the standard Gibbs dis- 
tribution for an ideal gas in the continuum. On physical grounds one expects 
that a similar result might hold for interacting particles. The model is as 
follows. Pick n G N and suppose that A can be partitioned into a regular 
array of d{n) sub-rectangles {-S™, . . . , S^ n ^} with volumes v{Slf n ) = Here, 

> for each m, n. Thus 

d{n) 

1^1 = ~ X] Z rn- 
n m=l 

As a guideline motivated by [6] we choose zl^ > and z n such that z^ — > 
and z n — > z > for n — > oo. For each m, n choose a point G S^. Now the 
following class of functions is considered. 

Assumption 4.1. Let (fk)k be a sequence of functions satisfying /o = 

1 and for each k>l, fk(x\, . . . , Xk) is a nonnegative function, Riemann 
integrable on A k , such that is a symmetric function for each k; that is, 
for any permutation a, fk{x a (i), ■ ■ -,x a ^) = fk(xi, . . -,x k ). 

Let X^ , 1 < m < d(n) , be 0-1 random variables with joint density function 
given as follows. If a±, . . . , aM n ) are such that a, G {0, 1}, and if 

d{n) k 
m=l m=l 

is the sum of the nonzero a^s, then 

P{Xi = a 1 , . . . , X^ n ) = a d(n) ) 

d(n) 

= K(d(n)r 1 f k (ql,...,ql)l[(z^r-, 

m=l 

where the normalizing constant is 

d(n) 

K(d(n))= £ ^l {Efl , =t} A«,..,<)IlK) a ™. 

ae{0,i} d ( ?l ) k m=i 
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Define 

d(n) 

o \ yn 

Jn — ^m- 

m=l 

Then, due to the symmetry of fk, 

{Zn) k ^d(n) v d(n) , ( n „mt / qn \ 

(4 1) P(S = k) = kl il=1 ^ifc=i /fc ^n'---' g »fc^^=i u K D i m ) 

y^rf(n) (z n ) k y^d(n) ^ci(n) „ , n n n rrfc / qn \" 

Z^fc=0 fel 2^ii=l ' "2^i fe =i JkVli^ ■ ■ ■ ,Qi k ) ll m =l "v^nJ 
Note that we can write (4.1) as a Gibbs measure /x n of the form (1.5) with 

(4.2) LO n = Z n , V n {k)=log(W n (k)), k € {0, 1, . . . d(n)}, 

where 

d{n) d{n) fe 

Wn(k) = E • • • E /*(#.■ •-.«?») A ke{0,..., d(n)}. 

il=l «fe=l m=l 

Let 5 be a nonnegative integer-valued random variable defined by 

(4.3) P(S-k)- T\U^fk{xi,...,x k )dx 1 ---dx k 

We write (4.3) as a Gibbs measure fi of the form (1.5) with 

u, = z , v(k) = io g (w(k)), ke {0,1,...}, 

where 

W(k)= f k (xi,...,x k )dxf-dx k , ke {0,1,...}. 

In [6] the following class of functions is considered. Let (fk)k be a sequence 
of functions satisfying Assumption 4.1 and, in addition: 

(a) There exists a constant C such that f k (xi, ■ • ■ , x k ) < C k for all k > 0. 

(b) fk(xi, ■ ■ ■ ,x k ) = if = for some i / j. This is not a necessary 
condition; see [6]. 

In [6] it is shown that under these additional conditions, S n =>• S, that is, S n 
converges weakly to S , under the conditions that limjj—^oo z n = z and that 

lim ( max z™ ) = 0. 

\l<m<cf(n) / 

For the proof of this convergence, condition (a) on (f k ) k enters to ensure 
that the Riemann sum converges, so that the normalizing constants Z n 
and Z for [i n and for /x, respectively, are finite. Condition (b) avoids some 
measure-theoretic considerations. In [6] a bound on the rate of convergence 
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is given for the special case that fk = 1, all k > 0, resulting in a Poisson 
approximation . 

Here we give a general bound on the distance, not only in the case fj- = l, 
k>0. 

Note that in [6] the generalization of Poisson convergence allows the au- 
thors to develop a lattice-to-continuum theory of classical statistical me- 
chanics; similarly our results could be applied to obtain bounds on such 
lattice-to-continuum theory. 

Using Corollary 3.8 with fii = \i n and ^2 = H-> we obtain 

Ef(S n )- Jfdv 

< m m{E(S n )\\g fy \\0-^^ + -E 

E{S)\\ 9f y n \\(^^ + Z -fE 

00 

E K*0- 

k=d(n)+l 

Writing out the size-biased distribution gives the following result. 




W n (S*)W(S* - 1 
WJS* - l)W(S* 



Theorem 4.2. Let S n ,fj,,z n be as above. Let us assume that (fk)k sat- 
isfies Assumption 4-1 and condition (a). Then 

d T v(C(S n ),fi) 

<m&x(\\g fy \\,\\g f y n \\) 



xmin [E(S n y-^ L + fc/^fc) 



w(k)w n (k-i) 1 



W(k-l)W n (k) 

W n (k)W(k-l) 



W n (k-l)W(k) 



+ /%(/) E M*0- 

k=d(n)+l 

Remark 4.3. Note that for Theorem 4.2, no monotonicity assump- 
tions on the birth and death rates in the corresponding birth-death pro- 
cess are needed. For the bound to be informative, though, we need that 
maxdl^yll, ||<7/,vUI) < 00 ' conditions to ensure this behavior are given in 
Lemma 2.5. 
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For the case of nonincreasing birth rates and unit per capita death rates, 
we also obtain bounds based on the increments Ag. Recall the notation of 
Example 3.3. 

Proposition 4.4. Let S n be the sum ofX\, . . . ,X n , where X; L ~ Be(pi), 
and construct 5* via (3.2), and let fi be given by (1.5). Assume that the birth 
rates in (1.7) are nonincreasing, and assume that the death rates are unit 
per capita. Then 

d T y(C(S n ),fl) 

< W £{ e ^ + l)-nSn) 

(4.4) xmm{S(S n ,S* n -l), 

(\S n - S* + l|/ w ) e V(min(S„,S;-l))-V(min(S„,S;-l)+l)^ 

+ \\g f , v \\uE\e v ^ +i y v ^ -£ e ns«+i)-nSn)| s 

where we put S{x,y) = E^^+i J- 

Proof. With the notation as in Example 3.3, denote the conditional 
expectation given I = i by Ei. Let / G B. If Xi ~ Be{pi), i = 1, . . . , n, then 
X* = l. Put 

s n ,i =E^- 

By (3.4) and (3.2), we have 
\Ef(S n )-»(f)\ 



E Ei[e ViSn+1) - ViSn) 9fASn + 1)] 

i=1 L,j=iPj { 



Ee v(s n+ i)-v(s n ) Egfy + lj I 



E ^^E t [e V ^- V ^{g sy {S n + 1) - g,yiS„., + 1))] 
i=l Lj=lPj 

+ E igf y(S n ,i + l)( e ^(^+l)-V(a») _ Ee V(S n+ l)-V(S n ) : 



< 



E"=iPj i=1 
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x mm{S(S n ,S n> i), 

(\S n - S n>i \/u) 

x e V(min(Sn,3 n ,i))-V(mm(S„,S!n,i)+l)\ § n ^ 

x P(S n + S n ,i) 

+ \\g fy \\ujE\e v ^ + ^- v ^ - Ee v V» +r >- v <- s »\ 
where we used Lemma 2.1 for the last inequality. □ 

Note that in contrast to Theorem 4.2, we did not employ the difference 
between generators in the proof. 

4.1. Poisson approximation for a sum of indicators. In the Poisson case 
we can use the bound (2.20) on Ag instead of Lemma 2.1 to obtain improved 
overall bounds, as follows. Let X{ ~ Be(pi), i = 1, . . . , n; then as in Example 
3.3 we have for S n = Y^j=i^j that S* = S n j + 1, where as above S n> i = 
J2j^i^j- Let A = ES n = Y^j=\Pj- Recall that if fj, = Po(X), then u = A and 
V{k) = —A. Hence using (4.5) we obtain 



\Ef(S n )-Po(X)(f)\ 



^piEi(g fy (S n + 1) - 9f,v(S n ,i)) 



i=i 



Calculating as in the proof of Proposition 4.4 and using (2.20) we get 
d TY (C(S n ),Po(X)) 

(4.5) < ^ K £ 4 {min{s(S n ,S n J, ^!L_M(i _ e -*)}|s n / £ n>i j 

X P(S n 7^ S Hi i). 

In particular we recover the bound for couplings in [4], Theorem l.B, 

i —A n 

(4.6) d TY (C(S n ),Po(X)) < -^—Y^ Pi Ei\S n - S n ,i\. 

i=l 

Example 4.5. Assume that the variables S n and 5* can be coupled 
such that 

(4.7) \S n -S n;i \ < 1, i = l,...,n. 
Then we can improve the bound (4.6), using 

Ei{S(S n , S ni i)\S n S ni i\P(S n 7^ S Uj i) 
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< E 4 

1 + S n>i 

< P(S n ± S n ,i) 



= P(S„^„)i(l + P{S n ., = 0| S ,^S„„}). 

Example 4.6. In case that the Xj's are independent and we have S n 
S n — Xi, so that (4.7) is satisfied, and (4.6) yields that 



(4.8) d TY (C(S n ),Po(\))<J2p 



i=i 



2 l-e~ A 
A 



The bound (4.8) coincides with the bound (1.23) given in [4]. We can improve 
on it by using that 

P{S n ,i = 0\S n ^ S n ,i} = P[J2 Xj =0\Xi = l\= n(l - Pi)- 

This results in the bound for the independent case 



11, T-r,. A l-e" A 



d TV (C(S n ),Po(X)) < J> 2 mm - 1 + - Pj ) ) , 

t=i IV j^i 



A 



To see when the above bound is an improvement on the bound (1.23) given 
in [4], consider the inequality 

I, . 1 - e~ A 
-(! + .)<—. 

We rearrange this inequality to give 

Ax<2- A-2e~ A . 

Numerical calculations yield that the right-hand side is positive when < 
A < 1.59362. For such A and some y such that 0<y<2 — A — 2e~ A we can 
construct strongly inhomogeneous cases such that 

for some indices i. Examples are, for A = 1, the vector p = (pi, . . . ,p n ) = 
(l,0,0,...,0),or the vector p = (p!,...,p n ) = (l-l/(n-l),l/(n- l) 2 ,l/(n - 
l) 2 ,...,l/(n-l) 2 ) for n>3. In such cases, our bound provides an improve- 
ment. 



STEIN'S METHOD FOR DISCRETE GIBBS MEASURES 27 

Example 4.6 can only be an improvement on (4.6) when the underlying 
random variables are not identically distributed; in the independent and 
identically distributed case, we recover exactly the bound given in [4], (1.23), 
page 8. 

4.2. An example with repelling interaction. Our first example with in- 
teraction partitions A = [0,1] into n intervals, the ith interval given by 
Sf = [^rK^]) and we choose = the midpoint of the interval 5". 

Then each S™ has volume ^ = we choose zf = ^ and z n = A. Note the 
freedom of choice in A > 0. We consider the set of functions f k given by 
/o = 1, fi(x) = 1 for all x, and for k > 2, f k (xi, ■ ■ ■ , Xk) = if Xi = Xj for 
some i / j, and otherwise 

f k (xi,...,x k ) = {Xi-Xjf, 
l<i^=j<k 

so that values x\, . . . ,x n which are far away from each other are preferred. 
Then clearly f k satisfies Assumption 4.1 as well as conditions (a) and (b). 
To avoid trivialities we assume that n > 2. 

In our setup we choose 0-1 random variables X^, 1 < m < n, with density 
function 

n 

P(X? = a u ...,X% = a n )ocn~ k \ k £ (tf - q]) 2 if k = £ a m > 2; 

l<ij^j<k m=l 

P(X? = <n,..., X% = a n )(x- iffc = l; 

n 

P(A7 = 0,...,AC = 0)ocl. 

With S n = Em=l X m we have with that 

\k 

P{S n = k)<X—W n {k), 

where 

W n (k) =n- k J2---J2 ■ ■ ■ ><)> k = 0,...,n. 

h=l i k =l 

Let S be a nonnegative integer-valued random variable defined as in (4.3) 

by 

§/ [0!l]fc / fe (a;i,...,x fe )dxi---dx fe A fc w .,. 

P(«S = fc) = oc — W(k), 

Efeto fef I[o,i]h fk{xi,. ..,x k )dxf- dx k 



with 



W(k)= f k (x 1 ,...,x k )dxi---dx k , k>> 

J[o,i] k 
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It is immediate that W(0) = W(l) = 1, and for k > 2, 

k rl rl k(k-l) 



^(*0=EE/ I {xi-xjfdxidxj 
~i~T ; Jo Jo 



6 



Thus our limiting Gibbs measure // has normalizing constant 

1^ \ k A 2 



and is therefore given by 

v2 



M0) = (l + A+^e A y 



^={ i+x+ py^ k> - 2 - 

To assess the distance between the distributions of S n and of S we note that 
W n {0) = W„(l) = 1 = W(0) = W{1), and some algebra yields, for k > 2, that 



^w^e-ee 

i\ =1 ^fc— 1 



2i s - 1 2i 3 - - 1 



2n 2re 



2 



-- fe - 2 E EE(*-;)V=- 

l<Z^s<fci=l j=l 



Thus 



2n" 4 ^-l) £* 2 - £ 

U=i Vi=i 

fc(fc-l)(n + l)(n-l) 
6n 2 

W n (*0 W(fc) 



W n (&-1) W(fc-l) 

for fe = 0,. . . ,n, and we are in the situation of Example 3.9. Therefore it 
follows that 

oo 
k=n+l 

A 2 A" 1 ^ A fc 
1 + AH e A > — 
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A n+l e A 



< 



( n + i)l(i + A + f e A )' 
Summarizing, we have: 

Corollary 4.7. Let S n and S be as constructed above. Then 

A n+i e A 



d TV (C(S n ),fj,) < 



(n + l)!(l + A + f e A ) 



4.3. A second example with interaction. Using the same partition of A = 
[0, 1] and the same notation as in the previous example, and choosing for 
simplicity u = A = 1, we now consider the set of functions {fkjk given by 
fo(x) = fi(x) = 1 for all x, and for k > 2, 

fk(%li • ■ • ) -Ek) = X{Xj. 

l<iy£j<k 

Clearly {fkjk satisfy Assumption 4.1 as well as conditions (a) and (b). We 
take qf = the left endpoint of the interval Sf. Assume that n > 3. 
Along the lines of the calculations in the previous example we obtain now 
W(0) = W(l) = 1 and for k > 2 

W(k) = k~ k , 

so that the distribution of S given in (4.3) is 
We note that 



oo -, 

1< Z = 1 + V -e' klnk < e 1/e . 
— u — 



k=l K - 

For the distribution of S n given in (4.1) we obtain W n (0) = W„(l) = 1, and 
for k>2, 



W n (k)=n 



\i=0 I 



We employ an integral approximation, obtaining 

k~ k <W n {k)<k-\ k>2. 



n 

Observe that for k = 0, . . . , n, 

'n-l\ k \ W n {k) 



< TTT77T" < 1. 



n J ~ W(k) 
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For Theorem 4.2, we may thus bound for k = 2, . . . , n, 
W(k)W n (k-l) 



W(k - l)W n (k) 
< max< 1 



n 



n — 1 



I- 1 






(n- 


- 1 


V n 


k' 1 




- 1. 



n 



\n-l 



Now we calculate 

k 



W(k)W n (k-l) 
W{k-l)W n (k) 



<^k^ n (k) 



n 



n—1 



Taylor's expansion gives the inequality 



0< 



n — 1 



1 < 



k' 2 



1 + 



1 



n — 1 V n — 1 



^!_ P fc 2 /(n-i) < Ji 



< 7 e 

n — 1 



n — 1 



for k <n and n > 3. Applying this inequality, we obtain that 



Y^kfj, n (k) 

k 



W(k)W n (k-l) 



W{k-l)W n {k) 



< 



< 





1 


n—1 
k=l K ' 


(n 


-1)2* 




e e 


2e e 
< — 
n 


(?) 


~l)Zn 



-klog(k)+k 



where we used Z n = 1 + X)S£Li H^ 7 ™^) — 1 m ^ e l as t inequality. Next we 
estimate 



1 ^ 1 



E E ^ fc < E *»- fc ^ 1/B ™ 

k=n+l ^k=n+l K - fc=n+l fc - ^ + Ij. 



where we used that Z > 1. 

Note that A2 = sup fc - — p — = 00, so that we cannot apply Lemma 2.5; 
instead we find an alternative bound on ||<?/ 5 y|| as follows. From (2.6) we 
have 



\9f,v( 



0- + i)i<^ e -^) £ ^<E^- fc = ^ 1/e - 



Corollary 4.8. Let S n and S be as constructed above, and let n> 3. 
Then 



d TY (C(S n ),fi)<2 



e+l/e -(n+1) 
£ + e V"2 



n 



(n+1)!' 
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